Rank in Wordlist | Frequency | Word |
---|---|---|
2471 | 92 | 1,5 |
3441 | 61 | 2,5 |
4010 | 50 | 1,2 |
5131 | 36 | 6,5 |
5730 | 31 | 1,3 |
5732 | 31 | 3,5 |
6328 | 27 | 4,5 |
6713 | 25 | 1,4 |
6925 | 24 | 1,1 |
7382 | 22 | 0,5 |
Rank in Wordlist | Frequency | Word |
---|---|---|
48664 | 1 | 6(a |
48930 | 1 | 7-6(0 |
49570 | 1 | A(idn |
49571 | 1 | A+(idn |
49599 | 1 | AA(idn |
49600 | 1 | AA-(idn |
49809 | 1 | ASII(1,5 |
51922 | 1 | BMRI(2,8 |
52028 | 1 | BUMI(7,44 |
52040 | 1 | BYAN(4,24 |
Rank in Wordlist | Frequency | Word |
---|---|---|
38662 | 2 | SBY)-Boediono |
45612 | 1 | 1)Air |
45893 | 1 | 10)--wamen |
49759 | 1 | APEC)-Bali |
49769 | 1 | APPBI)Jakarta |
52029 | 1 | BUMN)diharapkan |
53223 | 1 | Bodetabek)-yang |
53738 | 1 | CAR)perbankan |
53806 | 1 | CIS)-Rencana |
56877 | 1 | Espos)--Dewan |
Rank in Wordlist | Frequency | Word |
---|---|---|
22416 | 4 | 20%-30 |
32644 | 2 | 10%-15 |
32797 | 2 | 20%-25 |
32949 | 2 | 30%-35 |
33129 | 2 | 7%-8 |
45414 | 1 | 0,015%--0,03 |
45890 | 1 | 10%--15 |
45891 | 1 | 10%-20 |
45892 | 1 | 10%-30 |
46031 | 1 | 11%-13 |
Rank in Wordlist | Frequency | Word |
---|---|---|
8625 | 18 | S&P |
13032 | 10 | S&P 500 |
16242 | 7 | E&P |
22528 | 4 | AT&T |
23097 | 4 | GmbH & Co |
27603 | 3 | Hebel International GmbH & Co |
28909 | 3 | R&B |
28910 | 3 | R&D |
38647 | 2 | S&P/ASX |
49822 | 1 | AT & T |
Rank in Wordlist | Frequency | Word |
---|---|---|
14088 | 9 | US$1 |
15287 | 8 | US$2 |
18576 | 6 | US$10 |
18577 | 6 | US$100 |
21113 | 5 | US$30 |
24348 | 4 | US$1,1 |
24349 | 4 | US$120 |
24350 | 4 | US$200 |
24351 | 4 | US$5 |
24352 | 4 | US$6 |
Rank in Wordlist | Frequency | Word |
---|---|---|
28855 | 3 | Poor"s |
42566 | 2 | lainnya,"ujar |
46775 | 1 | 1927,"kata |
51185 | 1 | Arfa"s |
51810 | 1 | BBM,"ucap |
52041 | 1 | Ba"asir |
53767 | 1 | CEO"s |
54971 | 1 | D"Agreves |
57716 | 1 | G.I.F.)"yang |
59423 | 1 | Hills"-nya |
Rank in Wordlist | Frequency | Word |
---|---|---|
11296 | 12 | Ballon d'Or |
16587 | 7 | Poor's |
37518 | 2 | O'Brien |
50734 | 1 | America's Next Top Model |
61111 | 1 | Jeanne d'Arc |
66855 | 1 | Moody's Investors Service |
68176 | 1 | O'Mine |
68177 | 1 | O'Neal |
68178 | 1 | O'Neill |
69982 | 1 | People's Choice |
Rank in Wordlist | Frequency | Word |
---|---|---|
23119 | 4 | H+7 |
33254 | 2 | ASEAN+3 |
35281 | 2 | H+1 |
35282 | 2 | H+3 |
45613 | 1 | 1+700 |
45614 | 1 | 1+800 |
49130 | 1 | 8+350 |
49131 | 1 | 8+600 |
49571 | 1 | A+(idn |
57787 | 1 | GMT+13 |
Rank in Wordlist | Frequency | Word |
---|---|---|
73548 | 1 | SM*SH |
Rank in Wordlist | Frequency | Word |
---|---|---|
2420 | 95 | kabupaten/kota |
6566 | 26 | I/2012 |
6717 | 25 | 26/12 |
6927 | 24 | 2012/13 |
7641 | 21 | HIV/AIDS |
7877 | 20 | 14/12 |
8497 | 18 | 16/12 |
8498 | 18 | 17/11 |
8502 | 18 | 31/10 |
8504 | 18 | 4/10 |
In the last subsection of this type we look for words containing other special characters: , ( ) % & $
" ' + * = / _
Depending on the language some of these characters may be allowed within words, other will not. If words with forbidden characters do not have very low frequency there might be a problem in preprocessing.
Words containing %:
select w_id-100,freq, word from words where w_id>100 and word like "%\%%" limit 10;
3.12.1 Words with Hyphens
3.12.2 Multiwords
3.12.3 (Multi-)Words with dots